Multilingual Autoregressive Entity Linking

نویسندگان

چکیده

Abstract We present mGENRE, a sequence-to- sequence system for the Multilingual Entity Linking (MEL) problem—the task of resolving language-specific mentions to multilingual Knowledge Base (KB). For mention in given language, mGENRE predicts name target entity left-to-right, token-by-token an autoregressive fashion. The formulation allows us effectively cross-encode string and names capture more interactions than standard dot product between vectors. It also enables fast search within large KB even that do not appear tables with no need large-scale vector indices. While prior MEL works use single representation each entity, we match against as many languages possible, which exploiting language connections source input name. Moreover, zero-shot setting on training data at all, treats latent variable is marginalized prediction time. This leads over 50% improvements average accuracy. show efficacy our approach through extensive evaluation including experiments three popular benchmarks where establish new state-of-the-art results. Source code available https://github.com/facebookresearch/GENRE.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multilingual Word Sense Disambiguation and Entity Linking

Nowadays the textual information available online is provided in an increasingly wide range of languages. This language explosion clearly forces researchers to focus on the challenging problem of being able to analyze and understand text written in any language. At the core of this problem lies the lexical ambiguity of language, an issue which is addressed by two key tasks in computational lexi...

متن کامل

Multilingual Entity Linking: Comparing English and Spanish

The Entity Linking (EL) task is concerned with linking entity mentions in a text collection with their corresponding knowledgebase entries. The majority of approaches have focused on EL over English text collections. However, some approaches propose language-independent or multilingual approaches to perform EL over texts in many languages. In this paper, our goal is to see how well EL systems p...

متن کامل

DeepType: Multilingual Entity Linking by Neural Type System Evolution

The wealth of structured (e.g. Wikidata) and unstructured data about the world available today presents an incredible opportunity for tomorrow’s Artificial Intelligence. So far, integration of these two different modalities is a difficult process, involving many decisions concerning how best to represent the information so that it will be captured or useful, and hand-labeling large amounts of d...

متن کامل

Multilingual Word Sense Disambiguation and Entity Linking for Everybody

In this paper we present a Web interface and a RESTful API for our state-of-the-art multilingual word sense disambiguation and entity linking system. The Web interface has been developed, on the one hand, to be user-friendly for non-specialized users, who can thus easily obtain a first grasp on complex linguistic problems such as the ambiguity of words and entity mentions and, on the other hand...

متن کامل

UBC Entity Discovery and Linking & Diagnostic Entity Linking

This paper describe the runs submitted by the UBC team at TAC-KBP 2014 for both English Entity Discovery and Linking (EDL) and Diagnostic Entity Linking (DEL) tasks. Our main interest was to compare the performance between two totally different name entity recognizer systems and to combine them with three different name entity disambiguation systems that were developed for the TACKBP 2013 EL ta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2022

ISSN: ['2307-387X']

DOI: https://doi.org/10.1162/tacl_a_00460